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We adapt the techniques in Stigler [Ann. Statist. 1 (1973) 472-477] to obtain a new, general 
asymptotic result for trimmed [/-statistics via the generalized //-statistic representation intro- 
duced by Serfling [Ann. Statist. 12 (1984) 76-86]. Unlike existing results, we do not require 
continuity of an associated distribution at the truncation points. Our results are quite general 
and are expressed in terms of the quantile function associated with the distribution of the U- 
statistic summands. This approach leads to improved conditions for the asymptotic normality 
of these trimmed [/-statistics. 
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1. Introduction and statement of results 

Stigler [23] developed an asymptotic result for the trimmed mean without requiring conti- 
nuity of the underlying distribution function associated with the observations. This result 
was extended to non-degenerate [/-statistics based on trimmed samples in Borovskikh 
and Weber [4] . An alternative method for developing robust versions of U -statistics is to 
consider the statistic formed by trimming the kernel values, rather than the observations 
upon which the statistic is based. This idea is discussed in, for example, Serfling [18], 
Choudhury and Serfling [7] and Gijbels, Janssen and Veraverbeke [10]. In this paper, 
we use the generalized L-statistic representation developed in Serfling [18] to obtain an 
asymptotic result for trimmed /7-statistics under quite general conditions. We will not 
require continuity of the relevant, associated distribution at the truncation points. 

Let X, X\, . . . , X n be independent identically distributed random variables, taking val- 
ues in a measurable space (X, 23(X)) and having common distribution F. Let h be a 
symmetric function from X m to R and denote by Hp the right-continuous distribu- 
tion function of the random variable h(Xi, . . . , X m ). Set N = ( n ) and let hi, . . . ,/ijv 
be an enumeration of the values of h(X^, . . . , Xi m ) taken over the N m-tuples in 
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&nm = {{iii ■ ■ ■ i*m) : 1 < ii < • • • < hn < n}. Note that these random variables hi are, 
in general, dependent. Let h n \ < ■ ■ ■ < h n N denote the ordered values of hi,..., hpj. 

The original [/-statistic is defined as an average taken over the N possible outcomes 
h(Xi 1 , . . . ,Xi m ), 1 < ii < ■ ■ ■ < i m < n, that is, 



-1 n ,. 

(") ^/i(X n ,...,X lm ) = 7V- 1 ^/ lm = y xdH n (x), (1) 



where the empirical distribution function H n [x) of [/-statistical structure is defined by 
H »to=(Z) 12 I {KX il ,...,X i J<y}, yeR, (2) 



and I {A} denotes the indicator of the set A. For any < 7 < 1, let 7V 7 = [jN], where 
[a] denotes the largest integer less than or equal to a. If < a < j3 < 1, then put N a p = 
Np — N a . The trimmed versions of U are based on trimming the second sum in (1), 

i=N a + l 

or on trimming of the range of integration in (1), 



L aj3 = I xdH n {x), (4) 
J[h a ,hf>) 

with h a = h n ff a and hp = h^^, where iV 7 = —[—77V], 7 = a, (3. For the results that fol- 
low, it is important to note that the lower bound for the integral in (4) is included and the 
upper bound excluded. This is critical since H n is a step function. With this constraint, 
we are able to obtain the asymptotic distribution of L a p without imposing any conditions 

on the nature of Hp. In Lemma 2.3, we show that L a p = /V -1 w 1 ^m- Thus, U a p 
and L a p differ in terms of their divisors, and there arc possible subtle differences in the 
number of summands. 

A class of generalized L-statistics, which includes (3) and (4), was introduced by Ser- 
fling [18]. The trimmed [/-statistics (3) and (4) are directly connected with generalized 
Lorcnz curves, which are important in financial mathematics (see, for example, Goldic 
[9], Helmcrs and Zitikis [13]). 

Clearly, H n (y) is an unbiased estimator of Hp{y). In the case m = 1 and h{x) = 
x, H n reduces to the usual empirical distribution function. Define the left-continuous 
quantile function Hp X {t) = inf{y e R : H F {y) > t], < t < 1, #^(0) = H F 1 {0+), for any 
distribution function Hp. The empirical quantile function has the form 

n r •_ 1 • i 

H-\t) =E^ 7 l ~JT < 1 ^ JJ b H n X { Q ) = h nl- 

i=l ^ > 
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A large number of authors have studied the weak convergence of such L-statistics in 
the case m = l,h(x) = x. A partial list consists of Chernoff et al. [6], Bickel [2], Shorack 
[19, 20], Stigler [23, 24], Csorgo et al. [8], Griffin and Pruitt [11], Cheng [5], Mason 
and Shorack [16]. For m > 2, under various sets of regularity conditions, asymptotic 
normality of various types of generalized L-statistics has been investigated by Silverman 
[21], Serfhng [18], Akritas [1], Janssen et al. [15], Helmers and Ruymgaart [12], Gijbels 
et al. [10] and Hossjer [14]. 

In the aforementioned papers, for m > 2, the results always assumed that Hp is con- 
tinuous or smooth. However, in modern statistical robust procedures and for bootstrap 
procedures, results allowing for the discontinuity of the underlying distribution function 
Hp are needed. We study the asymptotic behavior of U a p and L Q( g for any Hp without 
imposing the requirement of continuity. 

The conditions of our theorem and the limit random variable are defined via the values 
of quantile function Hp 1 at the points a and (3. Existing results handle the cases where 
Hp 1 (j+) — 11^ (7), 7 — a, (3. Our main result is derived without this assumption of 
continuity. We represent the trimmed [/-statistic as a sum of classical [/-statistics with 
bounded, non-degenerate kernels plus some smaller terms and then we apply standard 
results to such statistics. 

For convenience, in what follows, for the distribution function Hp, we denote the 
smallest quantile Hp 1 ^) and the largest quantile Hp 1 ( r y+) as, respectively, 

£" :=M{x€R:H F (x)>~f}, 
£+ :=sup{x €R:H F (x)<j} 

and A£ 7 = — £~ with 7 = a, j3. Let 

N N 

7V± = J2 lib N± = £ I{hi 

i=l i=l 

Note that 

L[ n (^-)=iV- 1 7V 7 ± , H n ($)=N- 1 N± (5) 
and LT~ 1 (7) = h n ff arc valid for all < 7 < 1 and the following events coincide: 

{H- 1 ( 1 )>x} = { 1 >H n (x)} 1 {i/- 1 ( 7 )<4 = {7<i/ Il ( a ;)}, xeR. (6) 
Introduce the functional 8 = 9 (Hp), where 

[((* - <^}+(3^)~((x- &)I{x < £} + oC+)] dHp(x) 

and the following functions with i£X: 

g(x) = [EI{h(x, X 2 , . . . , X m ) < tg}(h(x, X 2 , . . . , X rn ) - ft) + ^r] 
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- [EI{h(x, X 2 ,..., X m ) < &}(h(x, X 2 ,...,X m )-f+) + a£] - 0, 
g a (x)=EI{h(x,X 2 ,...,X m )<i+}-e a , 8 a = H F (&-), (7) 
gp{x) = EI{h(x,X 2 , ...,X m )<^}- 9 f3 

= l-9f,-EI{h(x,X 2 ,...,X m )>tp}, 6 =H F (£p). 

Note that for all < a < (3 < 1 and x G X, we have \g(x)\ < 4(|£+| + |^|). 

Let =Eg 2 (X),o* a =Eff t a {X),^ = Egj(X),c gga = Eg(X)g a (X),c ggfi = Eg(X)gp(X) 
and c g a gf) =Eg a (X)g p (X). 

Theorem 1.1. If a g > 0, then for any underlying distribution function Hp, we have 

^^-y/n{U a p -6)^t 9 ~ A£ a I(T a > 0)r Q - A£pl(rp < G)t p , 

where (T a ,T g ,Tp) is a trivariate Gaussian random vector with mean vector zero and co- 
variance matrix 

2 

2 

C 99a a g c 99fi 
2 

Corollary 1.2. For any underlying distribution function Hp, we have, when a 2 . >0, 

^-(L a p -S)At } - A&Jfo > 0)r Q - A&Jfo < 0) Tp , 

where {r a ,T g ,Tp) is a trivariate Gaussian random vector defined as in Theorem 1.1. 

Corollary 1.3. Suppose that the quantile function Hp (x) is continuous at the points 
a and f3. If a 2 > 0, then 

Vn(U a p -0) — > T g . 

m 

For the simple case m = 1, the functions in (7) reduce to 

g(x) = < h(x) < g}h{x) - EI{C+ < h(X) < g}h(X) 

g a (x)=I{h(x)<£+}-9 a , 

gp(x) = I{h(x) < g} -0p = l-9 p - I{h(x) > ^}. 

A useful application of the theorem for the m = 2 case is for the kernel h(x,y) = ^(x — 
y) 2 . This provides the asymptotic behavior of a natural, alternative robust version of 
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the sample variance. We will now develop explicit expressions for the terms in a more 
interesting example. 

Example. Let h{x\, . . . , x m ) = max{xi, . . . , x m } with m > 2 . Let F(t) be the distribu- 
tion function of X { and let Y = max{X 2 , . . .,X m }. Then H F (t) = (F(t)) m and 

g(x) = g a p(x)+€£g a (x) -£pgp(x), 

g al3 {x) = EI{£+ < max{x,F} < £j}max{ar,Y} - / ydH F (y) 
= <x< ^(Fix))™- 1 - f yiFiy))™- 1 dF(y) 

+ [ (I{x<y}-F(y-))yd(F(y)r-\ 

g a (x) = I{x < timti-))™- 1 (F(£-)) m , 
gp(x) = I{x < CjKFitj))™- 1 - (F(^)r. 

In addition, 

a 2 g = Egl p (X) + E[Z+g a (X) - ^g (X)} 2 
+ 2Eg aP (X){ti+g a (X) - $gp{X)}, 

Eg a {X)g p (X) = (F{£-)F{£j)) m - 1 F{£-){1 F(^)), 

771—1 



Eg aP (X)g a (X) = (F(£-)) m / (1 - F(y-))yd(F(y)) 



Eg aP {X)g fi {X) = {F{C p )) m -\l-F{C )) y{F{y)T^ &F{y) 



+ (i ? (^)) m - i (l - F(^)) / F(y-)yd(F(y)r-' 

Consider the distribution function 

F{t) = 2tl{0 < t < ±a 1/m } + a lfm I{\a 1 ' m < t < a 1/m } 

+ tl{a 1/m < t < /3 1/m } + ^^ n I{f3 1/m < t < 2/3 1/m } 
+ hl{20 l ' m < t < 2} + /{t > 2}, teR. 
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F(C-) = « 1/m , F(t)=t, t€[a^ m ,0 m ], a 2 g >0 
and the limiting behavior is given by 

<MV Q p -fl)Ar r ^ m I(r a > 0)r a - f3^ m I(T P < 0) Tp . 
However, for the simpler distribution function 

F(t)=tl{0<t<l}+I{t>l}, teR, 

we have 

C = C = « 1/m $ = $ = P 1/m , 

*"(£-) = « 1/m . F(^) = /3 1 /™, F(t)=t, tela 1 /™,^" 1 ], o* > 
and we get the asymptotic behavior covered by Janssen et al. [15], 

- — -Vn(U a p -6)-^Tg. 
m 

2. Proofs 

The following two lemmas are key results for the proof. 
Lemma 2.1. The following representation holds: 

E h ™ = E tp}** + - N °) - & ( n p - N ?) 

i—N a + 1 i=l 

- A£ a /{iV Q < iV+}(iV+ - JV a ) - &tpI{Nj < N p }(Np - Np) 

+ L Q +L (3 , (8) 

where L Q = J Q — J a wit/i 

J Q =/{iV Q <iV+} E (kw-C). 4 = /W<^a} E (^-C) 

i=N a + l i=N+ + l 
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and hp = Jp — Jp with 



i=N„+l 

N/3 

Jp = I{Np<N fi } (h ni -$)- 

i=N~ + l 

Proof. For i = 1, . . . ,N, write 

h ni = (h ni + A£ a )I{h ni < £+} + h ni I{£+ < h m < Cp} + (h m - A£p)I{h ni > 

Since h nl = h m + A£_ a I{h m < £+} - A£pl{h ni >^}, I{h m < £+} = I{i < N+} and, by 
(6), I{h n i > £7} — I{i > N7}, we can write 

h ni = J2 h m -A(; a I{N a <N+}(N+~N a ) 

i=N a + l i=N cc + l 

-A&I{Np<Np}(Np-N ). (9) 

Note that h nfj + < < h n<] ^+ +1 and h nN - < ^ < h n>N - +v From (6), we have < 
Ki <Zp}= I{NZ < i < Np}. Hence, in (9), 

Np N- Na 

h ™ = h m-I{N+ <N a } J2 h ™ 

i=N a + l i= N+ + i l= N+ + l 

+ I{N a <N+} (hni + A£ a )-I{Np<Np} ^ h ni 

i=N a + l i=Np+l 



I{Np<Np} (hm-AZp) 



N 



i=l 

+ L a +hp. (10) 
Equation (8) follows from (9) and (10). This proves Lemma 2.1. □ 
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Figure 1. Plots of H(-) with £ = (a) H(g-) = 7 = H(g); (b) < 7 = 

(c) H(g-)<y<H(g). 

Lemma 2.2. iVoie that 

Np N 

N- 1 h ni = N- 1 J2H^<hi<^}h i + ^+(H n ^+-)-a) 

i=N a + l i=l 

- ^(H n (g) -0)- A£ a I{N a < N+}(H n (t;+-) - a) 

- A^I{N- < Np}(H n (g) n-^Qn, 

where g„ — > in probability as n — > 00. 

Proof. We shall estimate L Q and L^, taking into account the values of the distribution 
function Hp(x) at x = with 7 = a, /3. Figures 1 and 2 illustrate the different situations 
that need to be considered. 




Figure 2. Plots of H(-) with £~ < (a) 7 = H(&) = £T(£+); (b) £T(^-) < 7 = #(£7) = 

( C ) ff(^-)= 7 =fl-(e+-)< J ff(^+). 
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Estimating L a . Noting that > h ni } = I{i < N+}, we write 
J a =I{N a <N+} £ (/i™ 

i=JV Q + l 

= I{N a <N+} ]T (^-^/{C^i <t<i\£} 

-/{iV Q <7V+} ]T (Z*-h ni )I{Z->h ni }I{i<N-} 

= J + -J- 

It is clear that if £~ = then J a — a - s - Let £~ 7^ as is the case in Fig. 2. In this 
case, Hp(£~) =ol = ifp(£+— ) and we can write 

0<#<J{iV-<JV+}(£-£-) £ 

»=JVa +1 

TV 

<Ae a 5^/{e-<hi<f+}=0 a.s. 
t=i 

since < ^ < £a } = Hf(£o~) ~ Hf(€<x) = ^- Hence, we always have the relation 

J+ = a.s. To estimate J~, we note first that if N£ > N~ , then the indicator I{i < 
N~} = for all i = N~ + 1, . . . , N+. Therefore, we have the inequalities 

0<J-<I{N a <N-} ]T (f- - h ni )I{^ > h ni ] 

< I{N a < N~}(N~ - N a )(£ ~ hnN a )I{Za > h nN a }- (11) 

Further, we shall apply the technique used in Smirnov [22] with a probability inequality 
from Hoeffding [14] (or see, for example, Serfiing [17], pages 75 and 201). Thus, 

P{(& ~ h nN a > e) n (C > h nNa )} 

= P{i/„(^-£)>iV" 1 ^a} 

= P{H n (£- -e)- if (C - e) > - H(£- - e)} 

< Cl exp{-c 2 ne 2 a ^-,e)} (12) 
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with some positive constants c\ and ci , depending only on m and 9 a (£ Q , e) = a — 
Hp(£~ — e). Further, a (^~ , e) > for any small values of e > 0, by the definition of the 

smallest a-quantile £~. Under the conditions of the lemma, \/nN^ 1 (N ^ — N a ) — ^ r" 
as n — > oo. Hence, y/nN^ 1 J a — > in probability as n-> oo. 

Next, we consider J a . By definition, iV+ < N+ and since I{h ni < £+} = /{i < iV+} 
and /i nA r+ < < AT++1' ^ follows that the indicator I{h ni = £+} = 1 for i = iV+ + 
1,...,N+ and we can write 

0<J a = I{N+<N a } 2 (^i-^{^™>^} 

= /{^V+<iV Q } £ (*»< 
i=Ar+ + l 

< I{N+ < N a }(N a - N+)(h nNa - &)I{h nNa > £+}. (13) 

In (13), we need to consider two cases: Hp(£a) = a an d a < -^-F (£<*)■ ^ n * ne ^ rs t case, 
Hp(^a) = a an d we nave the weak convergence • s /nA r " 1 (A^+ — iV Q ) — — » r+ as n oo 
and the following estimates which are similar to (12): 

P{{hnN a - £ > e) n (/i„jv„ > C)} 
<P{/i„JV a -C>£} 
= P{AT 1 iV a >#„(£++£)} 

= P{F(£+ + e) - H n (& + e) > H(£+ + e) - iV" 1 ^} 
<c 1 cxp{- C2 n^(^+,e)}, (14) 

where <5a,(£+,e) = Hf(£+ + e) — a. In addition, 5 Q (£+,e) > for any small values of e > 
because of the definition of the largest a-quantile £+. Hence, in the case Hp(^) = a, 
we have y/nN~ 1 J a — > in probability as n —> oo. In the second case in (13), 5 Q (£+,0) = 
Hp(££) — a > and we have the representation 

V^iNT W - N a ) = MHn(ti) - H F (£)) + V^6 a (C, 0) + w„(a), (15) 

where V^(#„(£+) - A t+ and w n (a) = ^N~ l {aN - [aTV]) = Ofa" 1 / 2 ) as 

n —} oo, but the positive term y/riS a (^, 0) is unbounded. Therefore, in this case, we shall 
apply the estimate (14) with en -1 instead of e, that is, P{(h n N a — > en" 1 ) n (/i n jv a > 
^ c i ex P{~ c 2"-<5a(Cft: en" 1 )}. Since the distribution function i/p is continuous from 
the right at the point it follows that S a (^,0) < 5 a (£+,£n _1 ) for any small e > and 
sufficiently large n. Hence, in the second case, a < Hp(£+) and (14) is replaced by the 
inequality 

P{(hnN a - £>en- 1 )n(h nNa >C)}<ciexp{-c 2 n^^+ j0 )}, (16) 
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which provides the desired convergence y/nN -1 J a — > in probability as n — > oo. Thus, 
we have proven that ■y/nN~ 1 I^ a — > in probability as n — > oo. 

Estimating hp. Noting that I{£p > h n i} = I{i < N^}, we write 

N i 

0<-Jp=I{Np<N^} (Cp -h ni )I{Cp>Ki\ 

i=Np + l 

< I{N P < Np}(Np - N p )(g - h nNfl )I{g > h nNfl }. 
Here, by analogy with (12), we have 

P{(Zp ~ K Np >e)D (£~ > h nNp )} < P{^ - h nNl) > e} 
= P{F„(^-e)>^V- 1 iV /J } 

= P{H n (g - e) - H(£j -e)> N^Np - - e)} 
<c 1 exp{-c 2 n9l(^,e)} (17) 
with Op(£p ,e) = (3 — Hp(£7 — e) and by analogy with (15), 

V^N~\Np - N P ) = yfii(H n (tp) -H F (£j)) + yft0f,(tj,O)+cj n (0). (18) 

Here, we need to consider two cases: (3 — Hp(£7—) = and f3 — Hp(£7—) > 0. In the 
first case, we apply the inequality (17) with sufficiently small e > 0. In the second case, 
we use (17) again, but with parameter en -1 , as in (16), to get 

^{(^-^>m" 1 )n(f^>^)}<ciexp{-c a nfl|(^-,0)} (19) 

since the distribution function Hp has a limit from the left at the point £J and 
Hp(£,p—) > Hp(t;p — en -1 ). In this result, we have ^fnN" x J^ — > in probability as 
n — > oo. 

Finally, we consider Jg. Since I{h n i > £^ } = /{i > N^ 1 }, we write 

AT/3 

J fi = I{Np<N fi } £ {h n t-$)I{hni>$} 

= -I{Np<N fl } -hm)I{^ <h m <^}I{N^ <z<N+} 

+ I{N-<N P } J2 i.Ki-^)I{h ni >^}I{i>N+} 

= —Jp + Jp . 
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If £p = £p, then Jp = a.s. Now, assume that ^ j£ £^ . In this case, Hp(^ ) = (5 = 
Hp(£p—) and we have 

JV 

< A^^/{g</ii<^} = a.s. 

i=l 

since EI{£j < hi < £^ } = Hp{^—) — Hp(^) = 0. Hence, we always have Jp = a.s. 
To estimate Jj~ , we write 

< 4 < 7{JV^ < - - £+)/{/i™ > ^} 

and apply the estimates (13)— (16) with f3 instead of a. We have ^/nN -1 ,]^ — » in prob- 
ability as n — > oo and hence y / nA^ _1 L ( g — > in probability as n — > oo. 

This proves Lemma 2.2. □ 

Proof of Theorem 1.1. Let C/(g) be a [/-statistic of the form (1) with the kernel 
g{x 1 ,...,x m ) = [I{h(x 1 ,...,x m ) <£p}(h(x 1> ...,x m ) 

- [I{h(x 1 ,...,x m ) < €£}(h(xi,...,x m ) -£+) + £*£+]. 

We see that 

2V 

[/(<?) = N- 1 YJ <hi< $}hi + -a)- Cp(H n (C p ) ~ /?)• 

i=l 

It is not difficult to verify for this function that Eg(Xi, . . . , X m ) =6 and g{x) = 
Eg(x, X2, . . . , X m ) — 8, x £ X; in addition, Eg 2 (X) > 0, by the condition of the theorem. 
Hence, the kernel g is non-degenerate and, by the central limit theorem for [/-statistics 
with such bounded kernels, we have the weak convergence T n g : — jti ^y/n{U (g) — 6) — )> Tg 
as n — > oo (see, for example, Borovskikh [3]). By the same central limit theorem, we have 

and 

T n /3 ■= m-^-y/ntHn^-) - H(£~)) —> T fj 

as n — > oo. Under the conditions of the theorem, we have E\I{N+ — N a > 0} — I{r a > 
0}| -> if A£ a ^ (in this case, #(£+-) = a) and E\I{Ng -N <O}- I{t p < 0}| -> 
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if A£a 7^ (in this case, H(£g) = [3). Further, it is easy to prove that the covariances 
Cov(r n *, T n *) — > Cov(t*,t*) as n — > oo, where *,* = a,g,/3. Now, apply Lemma 2.2 to 
complete the proof of Theorem 1.1. □ 

Lemma 2.3. The following representation holds: 



ni • 



Proof. By definition, we can write 

L a p= / I{h a <x < hp}xdH n (x) 
Jr 



IR 

N 



lr ^ I {h a < h m < hp}h ni 
t=i 

1 N 1 N 

— ^ !{ h ni < h fj }h m - — ^ I{hni < h a }h r , 



i=l i=l 
-i Np-l N a -1 

i=l i=l 



iV»-l 



i=JV Q 

This proves Lemma 2.3. □ 
The proof of Corollary 1.2 follows from Theorem 1.1 and Lemma 2.3. 
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